Speech-Music Discrimination from MPEG-1 Bitstream

نویسنده

  • ROMAN JARINA
چکیده

This paper describes a proposed algorithm for speech/music discrimination, which works on data directly taken from MPEG encoded bitstream thus avoiding the computationally difficult decoding-encoding process. The method is based on thresholding of features derived from the modulation envelope of the frequency-limited audio signal. The discriminator is tested on more than 2 hours of audio data, which contain clean and noisy speech from several speakers and a variety of music content. The discriminator is able to work in real time and despite its simplicity, results are very promising. Key-Words: audio, video, classification, speech, music, signal processing, MPEG

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Digital Watermarking of Mpeg - 2 Coded Videoin the Bitstream

IN THE BITSTREAM DOMAIN Frank Hartung Bernd Girod Telecommunications Institute University of Erlangen-Nuremberg Cauerstrasse 7, 91058 Erlangen, Germany [email protected] Proceedings International Conference on Acoustics, Speech, and Signal Processing (ICASSP 97), Vol. 4, pp. 2621{2624, Munich, April 1997. ABSTRACT Embedding information into multimedia data, also called waterm...

متن کامل

A comparative study in automatic recognition of broadcast audio

This paper provides a thorough description of a methodology which leads to high accuracy as regards automatic analysis of broadcast audio. The main objective is to find a feature set for efficient speech/music discrimination while keeping the number of its dimensions as small as possible. Three groups of parameters based on Mel-scale filterbank, MPEG-7 standard and wavelet decomposition are exa...

متن کامل

An Experiment in Audio Classification from Compressed Data

* This work was completed while the first author was with Dublin City University Abstract – In this paper we present an algorithm for automatic classification of sound into speech, instrumental sound/ music and silence. The method is based on thresholding of features derived from the modulation envelope of the frequency limited audio signal. Four characteristics are examined for discrimination:...

متن کامل

Design and implementation of a DSP based MPEG-1 audio encoder

The speed of current PCs enables them to decode and play an MPEG bitstream in real time. The encoding process, however, cannot be done in real-time. The purpose of this thesis is to produce a low-cost real-time Digital Signal Processor (DSP) implementation of an MPEG encoder. The DSP will provide an MPEG bitstream to the PC that can be saved to disk. The input to the DSP will be an analog audio...

متن کامل

Digital watermarking of MPEG-2 coded video in the bitstream domain

Embedding information into multimedia data, also called watermarking, is a topic that has gained increased attention recently. For video broadcast applications, watermarking schemes operating on compressed video are desirable. We present a scheme for robust watermarking of MPEG-2 encoded video. The watermark is embedded into the MPEG-2 bitstream without increasing the bit-rate, and can be retri...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001